Reasoning about Xml Schema Languages Using Formal Language Theory

نویسندگان

  • Dongwon Lee
  • Murali Mani
  • Makoto Murata
چکیده

A mathematical framework using formal language theory to describe and compare XML schema languages is presented. Our framework uses the work in two related areas { regular tree languages CDG + 97] and ambiguity in regular expressions BEGO71, BKW98]. Using these work as well as the content in two classical references HU79, AU79], we present the following results: (1) a normal form representation for regular tree grammars, (2) a framework of marked regular expressions and model groups, and their ambiguities, (3) ve subclasses of regular tree grammars and their corresponding languages to describe XML content models: regular tree languages, TD(1) (top-down input scan with 1-vertical lookahead), single-type constraint languages, TDLL(1) (top-down and left-right input scan with 1-vertical and 1-horizontal looka-heads), and local tree languages, (4) the closure properties of the ve language classes under boolean set operations, (5) a classiication and comparison of a few XML schema proposals and type systems: DTD, XML-Schema, DSD, XDuce, RELAX, and (6) properties of the grammar classes under two common operations: XML document validity checking and type resolution (i.e., XML document interpretation).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Foundations of XML Based on Logic and Automata: A Snapshot

XML query and schema languages have some obvious connections to Formal Language Theory. For example, Document Type Definitions (DTDs) can be viewed as tree grammars and use regular expressions, XML Schemas resemble tree automata. Likewise, there are immediate links to Logic, e.g., through the classical characterization of regular tree languages by monadic second-order logic. It is therefore not...

متن کامل

XML to Relational Conversion Using Theory of Regular Tree Grammars

In this paper, we study the different steps of translation from XML to relational models, while maintaining semantic constraints. Our work is based on the theory of regular tree grammars, which provides a useful formal framework for understanding various aspects of XML schema languages. We first study two normal form representations for regular tree grammars. The first normal form representatio...

متن کامل

Advances in Artificial Intelligence Theory

In recent years, the usual knowledge representation (KR) problem in artificial intelligence is how to automatically represent and transform different kinds of knowledge using one kind of schema. Especially this problem focuses on representing formal knowledge in natural language for human understanding. For this purpose, this paper proposes an extended super-function (ESF) schema to build a nov...

متن کامل

A Uni ed Constraint Model for XML

Integrity constraints are an essential part of modern schema de nition languages. They are useful for semantic speci cation, update consistency control, query optimization, etc. In this paper, we propose UCM, a model of integrity constraints for XML that is both simple and expressive. Because it relies on a single notion of keys and foreign keys, the UCMmodel is easy to use and makes formal rea...

متن کامل

PiDuce - A project for experimenting Web services technologies

The PiDuce project comprises a programming language and a distributed runtimeenvironment devised for experimenting Web services technologies by relying on solidtheories about process calculi and formal languages for XML documents and schemas.The language features values and datatypes that extend XML documents andschemas with channels, an expressive type system with subtyping, a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000